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The detrended cross-correlation coefficient pdcca has recently been proposed to quantify the 
strength of cross-correlations on different temporal scales in bivariate, non-stationary time series. It 
is based on the detrended cross-correlation and detrended fluctuation analyses (DCCA and DFA, 
respectively) and can be viewed as an analogue of the Pearson coefficient in the case of the fluctuation 
analysis. The coefficient poccA works well in many practical situations but by construction its 
applicability is limited to detection of whether two signals are generally cross-correlated, without 
possibility to obtain information on the amplitude of fluctuations that are responsible for those 
cross-correlations. In order to introduce some related flexibility, here we propose an extension of 
Pdcca that exploits the multifractal versions of DFA and DCCA: MFDFA and MFCCA, respectively. 

The resulting new coefficient Pq not only is able to quantify the strength of correlations, but also 
it allows one to identify the range of detrended fluctuation amplitudes that are correlated in two 
signals under study. We show how the coefficient pq works in practical situations by applying it to 
stochastic time series representing processes with long memory: autoregressive and multiplicative 
ones. Such processes are often used to model signals recorded from complex systems and complex 
physical phenomena like turbulence, so we are convinced that this new measure can successfully be 
applied in time series analysis. In particular, we present an example of such application to highly 
complex empirical data from financial markets. The present formulation can straightforwardly be 
extended to multivariate data in terms of the q-dependent counterpart of the correlation matrices 
and then to the network representation. 

PACS numbers: 89.75.-k, 89.75.Da, 89.65.Gh, 02.70.Rr 


I. INTRODUCTION 

Standard correlation measures like the Pearson corre¬ 
lation coefficient and the cross-correlation function re¬ 
quire stationary data in order to provide reliable results, 
which is a requirement that is hard to fulhll in many 
real-world situations (the financial and physiological data 
are the negative examples here M). (By stationarity 
we mean stability of the probability distribution func¬ 
tions of the data over time; from this perspective non- 
stationarity can be produced both by the long-range au¬ 
tocorrelations and by the pdf’s heavy tails that make 
any signal length effectively insufficient.) This problem 
can to some degree be resolved by replacing original sig¬ 
nals with the corresponding detrended fluctuations, i.e. 
by considering the power-law correlations between the 
polynomially detrended walks, which are much more sta¬ 
tionary than the orginal data. This approach was in¬ 
troduced in a context of autocorrelations in m as the 
detrended fluctuation analysis (DFA) and immediately 
gained popularity among the researchers working with 
empirical data across many disciplines. The modifica¬ 
tion of DFA oriented towards detection of the power- 
law cross-correlations is known as the detrended cross¬ 
correlation analysis (DCCA) [l2| and it can be used in 
analysis of bivariate and multivariate empirical data El- 

Ill. 

Among the reasons behind the popularity of DFA was 


its ability of detecting fractal character of signals, which 
was subsequently extended to the multifractal case (the 
MFDFA method E3): which also proved very useful if 
applied to empirical data El“[ll|: especially owing to its 
superior reliability if compared to other methods [3l| . 
DCCA was also generalized in order to be applicable to 
signals with multifractal cross-correlations and the re¬ 
sulting MFDCCA/MFDXA algorithm also attracted 
some attention [391 - 14^ . However, this generalization 
raises controversy as at some stage it requires to ne¬ 
glect the detrended covariance sign to avoid obtaining 
complex values, which leads to inevitable loss of infor¬ 
mation about analyzed signals and, consequently, to in¬ 
correct results [i^. This drawback has recently been re¬ 
moved with a new sign-sensitive method of the multifrac¬ 
tal detrended cross-correlation analysis with an acronym 
MFCCA [ 4 ^ [ 4 ^ that is more consistent than MFDCCA 
as a multifractal generalization of DCCA (for more de¬ 
tails see Section II). 

Although the DFA and DCCA methods were designed 
to deal with nonstationary signals, they are related to 
each other exactly in the same way as the variance and 
the covariance analyses are related to each other in the 
case of stationary data. Therefore, by exploiting both 
these methods, an analogue of the Pearson coefficient 
was introduced. It is known as the detrended cross¬ 
correlation coefficient pdcca [Hi and serves as a tool for 
quantifying strength of correlations in the fluctuations 
of detrended signals at a given time scale (dSl - IH - The 
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main advantage of pocCA over the Pearson coefficient is 
its ability to quantify correlations in nonstationary sig¬ 
nals [i^. What is important is that the signals under 
study may either be fractal or non-fractal since pocCA 
is defined for a single scale. It is also worth mention¬ 
ing that there is a counterpart of pdcca constructed for 
the detrended moving average cross-correlation analysis 
(DMCA) [ 5 ^ but considering it here exceeds the objec¬ 
tives of this work. 

By definition, the detrended cross-correlation coeffi¬ 
cient is insensitive to higher-order statistics of fluctua¬ 
tions, i.e., other than the simple covariance. This means 
that values of pocCA cannot indicate whether the de¬ 
tected cross-correlations between two signals originate 
from the fluctuations of all amplitudes equally likely or 
rather some specific range of amplitudes gives a domi¬ 
nant contribution, while the remaining fluctuations can 
be much less correlated or even completely uncorrelated. 
One might easily imagine situations, in which such in¬ 
sensitivity can be considered a serious disadvantage of 
the method. For instance, let each nonstationary signal 
in a pair be a mixture of two processes, one of which is 
correlated in both signals and has a relatively small am¬ 
plitude, while the other is unique to each signal and has a 
relatively high amplitude. The coefficient pdcca would 
probably indicate that the signals are somehow cross- 
correlated but it would not bring any information allow¬ 
ing one to identify the amplitude of the cross-correlated 
components. 

In order to avoid such insensitivity, here we propose 
a generalization of the detrended cross-correlation coef¬ 
ficient in such a way that it becomes sensitive to corre¬ 
lations in the fluctuations of a selected amplitude range. 
The easiest way to do this is by introducing a g-dependent 
detrended cross-correlation coefficient pq {q e TV) based 
on the so-called g-dependent fluctuation functions Fq 
from MFDFA and MFCCA [13, SJ ■ The idea is based 
on the fact that from a number of different values con- 
tibuting to a sum, one can select specific values (e.g., the 
large, the medium, or the small ones) by rising all the 
values in the sum to some power (high positive, small 
positive, or negative one, respectively). Like pdcca, the 
coefficient pq is not related to fractal properties of signals 
so it can be used to quantify cross-correlations between 
any signals. 

As by its definition pq is intended to be a tool for 
analyzing nonstationary signals, we expect that it can 
find broad applications in the studies of empirical data 
from natural complex systems: physical, biological, so¬ 
cial, financial, etc. In the remaining part of the paper, 
we present the formal definition of pq (Section II), show 
examples of how it works if applied to the computer¬ 
generated signals representing different stochastic pro¬ 
cesses (Section III) and to empirical data from the finan¬ 
cial markets (Section IV), and finally present the main 
conclusions (Section V). 


II. THE g-DEPENDENT DETRENDED 
CROSS-CORRELATION COEFFICIENT 

A fundamental quantity for the detrended fluctuation 
analysis as well as for any type of its derivative methods 
is the variance (covariance) fzz (Ixy) of fho detrended 
signals X, Y {Z stands for either X or Y). Let us consider 
a pair of time series j, and j, divided 

into 2Ms separate boxes of length s (i.e.. Mg boxes start¬ 
ing from the opposite ends). A detrending procedure con¬ 
sists of calculating in each box v {v = Q ,..., 2Ms — I) the 
residual signals A, Y equal to the difference between the 
integrated signals and the mth-order polynomials 
fitted to these signals: 

i 

X^{s,i) = ^x{vs+j) - (1) 

i=i 

i 

Y^{s,i) = '^y{vs+j)-Pl,^]^{j). (2) 

i=i 

In the present work we use m = 2. The covariance and 
the variances of X and V in a box v are defined as: 

1 ® 

fxris, ^) = - Y] i)Yi,{s, i), (3) 

fzz{s,v) = -^Zl{s,i), ( 4 ) 

where Z again means either X ot Y. These quantities 
can be used to define a family of the so-called fluctuation 
functions of order q [13,113 : 

, 2Ms-l 

= ^ Y sign[/iy(s,i/)] |/|.y(s,^/)|'?/^ (5) 

® j /=0 

.. 2Ms-l 

^2 [fzzi^x)] -(6) 

^ I'—O 

The above definition of F^yi^) guarantees that: (i) no 
imaginary part occurs in F^y{s) (only the absolute val¬ 
ues are raised to a real power q/2) and (ii) by preserving 
signs of the covariances information is lost 

while taking the absolute values. For q = 2 the above 
definitions are reduced to a simpler form: 

.. 2Ms-l 

Txv('S) = J^), (7) 

® i/=0 

2Ms-l 

^ fzzi^x) (8) 

^ u=0 

that may be interpreted as an average covariance and av¬ 
erage variances for the boxes of size s. In the standard 
application of the fluctuation functions in MFDFA and 
MFCCA, one observes the dependence of F'^y and F^^ 
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on the scale s and looks for a convincing scaling behavior: 

^ and/or which in¬ 

dicates a fractal structure of the signals (monofractal for 
the constant functions: ^{q) = c and S{q) = c, multifrac¬ 
tal otherwise). 

The structure of Eqs. (I3|) and (|4]), which resembles the 
ordinary covariance and variance, respectively, suggested 
one to introduce the detrended cross-correlation coeffi¬ 
cient [45j |: 


Pdcca(s) 



(9) 


Owing to its normalized range of values [5l|: —1 < 
Pdcca < 1 with pdcca = 0 in the case of uncorre¬ 
lated signals, pdcca = 1 in the case of a perfect cross¬ 
correlation, and pdcca = — 1 in the case of a perfect 
anticorrelation, the coefficient pdcca(s) can be used to 
quantify the strength of cross-correlations between the 
detrended signals X, Y on different scales s and to com¬ 
pare this strength among different signal pairs 

The structure of Eqs. 0 and dH) indicates that all 
the boxes contribute to the fluctuation functions and 
the correlation coefficient with the same weight, irre¬ 
spective of how large (or how small) is /^y(s,z/) and 
in a particular box i/. This means that, by 
using solely the coefficient pdcca, it is impossible to ob¬ 
serve how the boxes that are characterized by the fluc¬ 
tuations of a specific amplitude range contribute to the 
overall cross-correlations. The following definition of a 
new g-dependent detrended cross-correlation (qDCCA) 
coefficient Pq{s) allows one to overcome this constraint: 


Pqis) 




( 10 ) 


as the real exponent q plays the role of a respective filter 
here. For q = 2 we restore the definition dH) of pocCA, 
for g > 2 the boxes with high values of /|.y(s,i/) and 
contribute to Pq{s) the most, while for q < 
2 the boxes with relatively small values do it the most 
significantly. The more deviated from the value q = 2 
the exponent q is, the more extreme fluctuations in the 
corresponding boxes contribute to the coefficient Pq{s). 
Note that no form of the s-dependence in Eqs. © and dll) 
has been assumed, so the signals under study do not need 
to be fractal at all. 

For g > 0, values of the pq coefficient are bound within 
the same range as pdcca, i-e., 

-1 <P, < 1. (11) 


In order to show this, we fix the scale s and prove the 
following Cauchy-Schwarz-like inequality: 

<Fxx{s)Ky{s)^ 9>0. (12) 

First, observe that from the relation: a^“ -I- > 2a“6“ 


{a,b > 0, a > 0) it follows for any two boxes z/, p that: 

2 [/a:a:('S) M)/Y'y(s, p)] < 

^ [/xa:(S) p)] + [/a:x('S, p)/yy(s, i^)] 

(13) 

since /^^(s, zz) > 0. Now, to simplify the notation, we 
temporarily neglect the signum function in Eq. © and 
assume that all the covariances /^y(s,z/) are positive. 
We then start from the l.h.s. of Eq. (HI: 


.. 2Ms-l 

inYi^)? = H [fxY{s,-)fy = 




2Ms-l 


4:M? 


E [llrM]' 


1^=0 


2Ms-l 2Ms-l 


< 


^ ^ ‘^[fxYi.S,v)fxY{s,p)y^ < 

s u=0 fi=v+l 

, 2M„-1 

U2 ^ [/a:x('57'^)/yy(s, l^)]+ 

qfi 

q/'^^ 


4M? 

® i/=0 


.. 2Ms-12Ms-l 

i[fxxis,jy)fYYis,P')Y^ + 

s i/=0 

.. 2Ms-12Ms-l 

4M2 ^ ^ [fxxis,’^)fYYis,P)Y^ = 


iP—O ^—0 


= F^xxis)F^Yis), 


where we exploited the Cauchy-Schwarz inequality: 

^ fxxi^’ i')I yy {s, i'), the implication: 
|a| < |6| ^ |a|« < |&|« for q>0, and Eq. (fl^. 

In general, the covariances can be negative 

and we must take this fact into consideration. Fortu¬ 
nately, the following relation always holds: 


2 M .-1 


\J F^xis)F^Yis) < IfxYis,’^)]'^^'^ < 

® j/=0 

lMs-1 

< —^ ^^S^[fxYis,12)]\fxYis,12)\F‘^ < 

r _i 


< 


1 

2M 


2M, 


1^=0 


which ends the proof of the inequalities (fT^ and dill) . 

For q < 0 the situation looks different, because the 
implication |a| < |6| ^ \a\'^ < \b\'^ is false in this case. 
This means that the denominator in Eq. m may be 
arbitrarily small as compared to the numerator modulus 
and Pq{s) can then assume either large positive or large 
negative values: |p 5 (s)| 1 for some scales s. This can 

be evident for uncorrelated or partially correlated signals. 
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while it is unlike for perfectly correlated ones. Therefore, 
the interpretation of the values of Pq{s) for g < 0 is a 
delicate issue that will be discussed in more detail in 
Section III. 


III. COMPUTER-GENERATED SIGNALS 

In this Section we present examples of the application 
of the new coefficient pq to the computer-generated time 
series representing different stochastic processes, in which 
the cross-correlations can fully be controlled, and then we 
discuss the performance of the coefficient in those cases. 
Since pq is oriented towards analysis of nonstationary 
data, we prefer to employ stochastic models that produce 
signals with long memory and/or heavy-tailed probabil¬ 
ity distribution functions. Long memory, which produces 
trends, and heavy tails of the fluctuations’ p.d.f.s, which 
can produce apparent trends, are the principal sources of 
nonstationarity in data from natural complex systems, 
therefore their presence in the model signals is highly 
desired. 

We focus on two models: the autoregressive frac¬ 
tionally integrated moving average (ARFIMA) and the 
Markov-switching multifractal (MSM). The former pro¬ 
duces fractal signals with longmemory and, despite its 
origin in financial economics |52l |. it is broadly used to 
model anomalous diffusion in various fields of science, 
like atmosphere physics and geophysics 1^, [sj, astro¬ 
physics [HI , biology and physiology [ 53 , and many 

other. The latter can be viewed as a version of a random 
walk in random time [H^. It is based on the stochas¬ 
tic multiplicative cascades and creates signals with long 
memory, heavy-tailed p.d.f., and with arbitrary length 
(unlike more typical cascading processes whose length 
is limited to the consecutive powers of the number of 
branches) [s^. MSM can be exploited to model various 
cascade-like phenomena, e.g., turbulence and finan¬ 
cial volatility |6l| and to predict future evolution of 
the corresponding observables. 


A. Perfectly cross-correlated signals 

First, we study behavior of Pq{s) for a pair of maxi¬ 
mally correlated signals. In order to prepare such signals, 
we use the ARFIMA process defined by the following 
formulas: 


x{i) =^aj{da:)x{i - j) + e{i) (14) 

00 

y(*) = '^a 3 {dy)y{i - j) + e{i), 

1=1 

where the parameters dz {z = x, y), fulfilling the condi¬ 
tion: — 1/2 < dz < 1/2, characterize the temporal range 
of the linear autocorrelations in a:(i), y(i) and are strictly 


related to the Hurst exponents: H = 1/2 -|- The 
quantity aj{dz) is called weight and defined by: 


aj{dz) 


r(j - dz) 
r{-dz)T{i + j)- 


(15) 


The time series x{i) and y{i) are correlated due to a 
common noise term e(i) being an i.i.d. Gaussian ran¬ 
dom variable. The ARFIMA signals are (mono)fractal, 

so their fluctuation functions [^’1^(5)]^^'^ form the fami¬ 
lies of power-laws presented in the upper panels of Fig. [T] 
The fractally cross-correlated nature of x{i) and y{i) can 
be seen in Fig.[TJc) as a family of parallel power-law func¬ 
tions ■ Careful inspection of the three panels: 

Fig. [T](a)-(c) suggests that the fluctuation amplitudes of 
the signals under study are maximally cross-correlated, 
so we may expect that a properly defined gDCCA coef¬ 
ficient should reflect this fact with its value being close 
to unity. Indeed, in Fig. [T](d), Pq[s) ~ I irrespective of s 
and q. This invariance can also be a consequence of the 
stationarity of the detrended ARFIMA processes. For 
signals with heavy tails of the pdfs (which preserve their 
non-stationary character after detrending), we would ex¬ 
pect more sizable deviations of Pq{s) from unity even if 
such signals were on average strongly cross-correlated. 
This is because large fluctuations that may occur in dif¬ 
ferent signals at different moments can have strong im¬ 
pact on covariance of the signals in the related boxes (up 
to the considerably large widths s), which can suppress it 
in these particular boxes and thus influence F^yis)- The 
higher |g| is, the stronger effect of this type may happen. 


B. Uncorrelated signals 


In order to test the gDCCA coefficient Pq{s) in the ab¬ 
sence of cross-correlations, we take the same signals as 
before: x(i) and y(i) and destroy their temporal struc¬ 
ture by random shuffling. However, before we show 
the complete results, we have to discuss the problem of 
|pq(s)| > 1 for g < 0. Fig. [2Ka)(b) shows |pq(s)| for (a) 
g = —4 and (b) g = —2. As one can see, its values 
can span a number of orders of magnitude with the more 
negative g is, the larger this range can be. The unbound 
character of Pq{s) makes any inference on the strength 
of the cross-correlations virtually impossible. As it has 
already been mentioned in Section H, the high values of 
|pg(s)| are produced if the denominator in Eq. (|TTH) is 
much smaller than the numerator. This situation can 
predominantly happen if the signals under study are not 
cross-correlated or are cross-correlated weakly. This sug¬ 
gests that a value of |p 5 (s)| that deviates much from I 
in any direction may be viewed as an indicator of a lack 
of cross-correlations. Taking this into consideration, one 
can redefine the gDCCA coefficient in the following way: 


N |P9(s) if |P9(s)| < 1 

\[pq{s)]-^ if \Pqis)\>l. 


( 16 ) 




FIG. 1: (Color online) (a)-(b) The g-dependent fluctuation 
functions for two ARFIMA time 

series sharing the same Gaussian noise term. A family of the 
fluctuation functions for different values oi q, q £ (—4,4) are 
shown for each time series (the topmost one represents g = 4). 
The linear dependence of on the 

double logarithmic plots indicates a fractal character of the 
analyzed signals, (c) The g-dependent cross-fluctuation func¬ 
tion [F'J.y (s)]^^'^ for the same pair of the time series indicating 
fractal cross-correlations between them, (d) The g-dependent 
detrended cross-correlation coefficient pq{s) for different val¬ 
ues of q. Note the almost perfect cross-correlation for any s 
and q. 

Now the p*q{s) coefficient remains always within the in¬ 
terval (—1,1) even \i q < 0. Two typical cases can be 
distinguished in this situation; (i) Fig. [2Kc) exhibits that 
for q = —A the coefficient p*{s) indicates the expected 
almost-zero level of cross-correlations; (ii) for q = —2 
(Fig. HKd)) Pg{s) is clearly non-zero, but it is highly un¬ 
stable with respect to s, bouncing between positive and 
negative values. Both these cases, if found in an analy¬ 
sis, may thus be considered the indicators of uncorrelated 
signals. For simplicity, from now on we will omit the star 
in the notation of the gDCCA coefficient: 

Pq{s)=Pq{s)- (17) 

Complete results of Pq{s) for the randomized ARFIMA 
signals are presented in Fig. [31 In the light of the above 
discussion, we are justihed to conclude that the analyzed 
signals are not cross-correlated at any scale s and at any 
q < 0. Since for g > 0 and s > 10^ we observe some 
deflections of Pq{s) from zero, we have to perform a test 
that can resolve whether these deflections are statisti¬ 
cally significant. In order to do this, we calculate Pq{s) 
for N = 10,000 pairs of the randomized ARFIMA sig¬ 
nals and estimate the dispersion of Pq{s). The related 
standard deviations ap{q,s) are shown in Fig. U) A di¬ 
rect comparison between Fig. [3] and Fig. S] leads us to 
a conclusion that the deflections from zero observed for 
<7 > 0 in Fig. [3] cannot be considered statistically signifi¬ 
cant. Although we show here the results for the ARFIMA 
processes only, the qualitatively similar results can be 



FIG. 2: (Color online) The g-dependent detrended cross¬ 
correlation coefficient pq as a function of the temporal scale 
s for sample values of g < 0: g = —4 (left) and g = —2 
(right). (Top) Scale dependence of the originally defined pq{s) 
for which values outside the standard interval (—1,1) are al¬ 
lowed (the modulus |pg(s)| is taken in order to show both the 
positive (black circles) and the negative (red/grey squares) 
values on the logarithmic axis). Dashed lines indicate the 
\pq\ = 1 level. (Bottom) The modified coefficient p*q[s) de¬ 
fined by Eq. (HU) with its values being in the interval (—1,1). 
Those values of pq[s) that were inverted to obtain pj(s) are 
denoted by circles. 

obtained for time series representing other examples of 
the uncorrelated statistical processes, which supports our 
claim that Pq{s) correctly estimates the level of cross¬ 
correlations between uncorrelated signals. 

There is a good moment here to stress that a necessary 
condition for obtaining the correct results (i.e., the lack of 
cross-correlations) shown in Fig. |3|is a proper definition 
of the g-dependent fluctuation function F^yis). Such a 
definition requires preserving the signs of the covariances 
fxY{s,u) exactly as it was done in Eq. ([3]). Unfortu¬ 
nately, it often happens in literature that those signs are 
neglected and only the moduli of the covariances are con¬ 
sidered (e.g., 0). What consequences can this approach 
have is illustrated in Fig. |SJ We take the same data as in 
Fig.|3]and calculate a modified coefficient p'q that uses the 
no-sign definition of F^yis). As one can see, for g > — 1 
this coefficient falsely indicates the presence of statisti¬ 
cally significant positive cross-correlations across all the 
scales, even though we know that such correlations can¬ 
not exist between the independently randomized signals. 

C. Partially cross-correlated signals 

The above examples of the almost perfectly correlated 
and the almost perfectly uncorrelated signals form the 
two extremes that are the least interesting trivial cases 
between which one can find a spectrum of much more in¬ 
teresting cases where the signals may reveal partial cross¬ 
correlations. For example, the correlations that are re¬ 
stricted to the signals’ specific components or that are 
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FIG. 3: (Color online) The coefficient Pq{s) calculated for a 
pair of the randomized ARFIMA time series. In each panel, 
Pg(s) for different g is displayed. Statistical significance of the 
results can be assessed by comparing these results with Fig.|4] 
The inverted values of pq{s) for g < 0 are denoted by circles. 

transient in time. We shall discuss here both these situ¬ 
ations. 

First, we test sensitivity of the gDCCA approach if 
the cross-correlations are transient. In order to do this, 
we take two cross-correlated ARFIMA time series (with 
Pq{s) ~ I as in Section IIIA) and randomize a part of 
the data points in one of these time series. After this 
operation, the modified time series consists of a fraction (j) 
of the original data points in its center and a fraction \ — (j) 
of the shuffled points. Sample results of these calculations 
are shown in Fig. [5] for different choices of (f>. It is clear 
that in this particular case the method gives statistically 
significant indication of the cross-correlated character of 
fluctuations for as low as (j> = 2% correlated data points 
in both signals. This indication is restricted to the small 
and medium temporal scales (s < 10^) only, however. In 
order to obtain similar significance for the larger scales 
up to s = 10®, one has to increase the correlated fraction 
to (/) > 0.1, so that s < (f>T, where T is the length of the 
time series. 

Second, we assume that each time series under study 
consists of different components that have different am¬ 
plitudes. Only some of these components are cross- 
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FIG. 4: (Golor online) Mean {pq{s)) (black solid lines) and 
standard deviation Op{q, s) (blue/grey lines with error bars) of 
the coefficient pq{s) calculated for 10,000 pairs of the shuffled 
ARFIMA time series. In each panel, the result for different g 
is displayed. 

correlated between the time series, while the other ones 
remain uncorrelated. For instance, a pair of time series 
may be cross-correlated only via large or via medium 
fluctuations and uncorrelated in small ones. We there¬ 
fore expect that the gDCCA analysis will allow us: (i) 
to identify the existence of the cross-correlations and (ii) 
to show, what fluctuation amplitudes carry these cross¬ 
correlations. 

The ARFIMA processes are not ideal for being a sub¬ 
ject of the present analysis since their fluctuations are 
essentially Gaussian distributed and the range of their 
amplitudes is rather small. The method works fine also in 
this case, but here a much more instructive example can 
be the non-stationary processes with heavy-tailed fluctu¬ 
ations, where the amplitudes of the large and the small 
fluctuations differ considerably. The heavy-tailed fluctu¬ 
ations often appear in empirical data recorded from both 
natural and social systems, so our choice is realistic and 
it does not limit the applicability domain of the method. 

Let us consider time series generated according to the 
Markov-switching multifractal model developed in [s^, 
1^ . This is a multiplicative, hierarchical model that 
is able to reproduce the multifractal characteristics of 
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FIG. 5: (Color online) The incorrectly defined coefficient 
p'q{s) (using the no-sign definition of in Efi- ®) cal¬ 

culated for the same pair of the randomized ARFIMA time 
series that was already used in Fig. [S] Observe that for 
q > — 1 the coefficient gives a false indication of positive cross¬ 
correlations. Apparent statistical significance of these results 
can be seen by comparing them with Fig. |4] 

some types of empirical data (for example, the finan¬ 
cial volatility). From the point of view of the present 
analysis, it is important that it produces the (unsigned) 
signals with heavy-tailed probability distribution func¬ 
tions, whose fluctuations can exhibit long-range, multi¬ 
scale auto-correlations. According to the MSM model, 
an observable x{i) is given by: 

x{i) = a{i)u{i), (18) 

where u{i) is a Gaussian random variable and i plays 
a role of discrete time. a{i) is called the instantaneous 
volatility and defined as a product of a constant factor 
a and k multipliers Mj(i) drawn from the binary or the 
lognormal distribution: 

k 

cr2(f) = cr^ nMj(f). (19) 

1=1 

In the binomial case, Mj(i) = [m, 2 — m], 1 < m < 2, 
while in the lognormal case: Mj{i) = LN{—X,2X) (^ . 



FIG. 6: (Golor online) Sensitivity test for the Pq{s) coeffi¬ 
cient calculated based on the ARFIMA time series (q > 0). 
An original time series 1 is compared with a modified time 
series 2, in which only a fraction <j) of the data points is pre¬ 
served, while the rest is randomized (black lines). The re¬ 
sults for the null-hypothesis of the uncorrelated time series 
are also shown for reference as the +/— standard deviation of 
100 independent realizations of the shuffled surrogates (ma- 
genta/grey lines symmetric with respect to pq = 0). 

Each multiplier Mj{i) changes its value at time i with 
probability: 

7, = l-(l-7fe)'"', J = (20) 

where 0 < 7 ^ < 1 and 6 = 2,3,4,... . Here we choose the 
cascades with fc = 10 levels and 6 = 2 branches. We also 
omit the Gaussian-distributed factor u{i) in Eq. as 
being unnecessary from the point of view of our analysis, 
because the complexity of x(i) is related to the instanta¬ 
neous volatility 

As a consequence of the stochastic character of the 
multiplier values, two independently produced signals 
would not be cross-correlated. A pair of the multiscale 
cross-correlated time series can nevertheless be obtained 
from an MSM time series by copying the set of its mul¬ 
tipliers and adding a small amount of noise to 

each one: + |ae(i)|, where e{i) is an 

i.i.d. Gaussian random variable and a <C 1. 

Our objective is to show that the qDGGA coefficient 
Pq{s) can provide us with more information on the cross¬ 
correlation structure of these time series that the ordi¬ 
nary coefficient pdcca(s) can. We generate two lognor¬ 
mal MSM time series of length T = 10® with A = 1.1. 
The parameter a = 0.01 guarantees that their detrended 
fluctuations are cross-correlated. Indeed, Fig. [TKa) dis¬ 
plays that for q G (—4,4) the coefficients Pq{s) detect 
strong cross-correlations (for q > 0 even the maximum 
ones over all the scales). What then happens if we apply 
a filter that randomizes all the data points with small 
amplitude x{i) < 0.01? Action of this filter is equiva¬ 
lent to removing the cross-correlations among the small 
fluctuations and preserving the ones among the medium 

























FIG. 7: (Color online) The gDCCA coefficient Pq{s) calcu¬ 
lated for a pair of time series from the lognormal Markov- 
switching multifractal (MSM) model with fc = 10, fe = 2, 
A = 1.1, and a — 0.01 (see text for the description of these 
parameters). Four selected values of q are shown denoted by 
different lines, q ~ 2 corresponds to the standard coefficient 
Pdcca. (a) The original time series, (b) Both time series are 
filtered (see text) in order to remove the correlations among 
the small fluctuations (x(i) < 0.01). (c) The randomized data 
(single realization of the shuffled surrogates) without any tem¬ 
poral correlations, (d) Both time series are filtered in order to 
remove the correlations among the medium and large fluctu¬ 
ations (x(i) > 0.01). For clarity of the pictures, the inverted 
values of Pg(s) are not distinguished. 


and large fluctuations. Fig. [TKb) presents the results ob¬ 
tained for such filtered signals. One can see that while 
the cross-correlations at the longer scales (roughly, for 
s > 10^) are still detected by Pq(s) for any g, for g < 1 the 
gDCCA coefficients indicate a significant decrease of the 
cross-correlations at the shorter scales (s < 10^). This 
decrease is more explicit for g < 0 and for g = —4 the 
corresponding coefficient indicates that the signals are 
completely uncorrelated. As such small values of g are 
related to very small fluctuations, this result is exactly 
what one might expect for the filtered signals. On the 
other hand, for g > 2 the filtering does not introduce any 
change in Pg(s) and this is also perfectly compliant with 
the a priori expectation that for the large fluctuations 
the cross-correlations should survive. Now let us look at 
the results for g = 2 in Fig. [7Ka)(b): a change is hardly 
seen. Thus, in this case the pdcca coefficient proves to¬ 
tally insensitive to a significant change in the correlation 
structure of the data. 

Another example of such an evident lack of sensitivity 
of Pdcca can be seen in bottom panels of Fig. [T] If the 
same MSM time series as above are completely shuffled, 
all their temporal correlations are destroyed. Fig. [3(c) 
shows that the coefficients Pq{s) do not detect any cross¬ 
correlations accordingly. The results for the large scales 
are within statistical error even if they are far from zero; 
among them the coefficient for g = 2 is almost flat for the 
majority of scales except the largest ones. These results 


can be compared with the results obtained for the filtered 
signals (Fig. |3(d)), in which only the correlations among 
the small fluctuations {x{i) < 0.01) are preserved and 
all the other correlations are removed via data shuffling. 
Now the coefficients Pq{s) indicate the presence of cross¬ 
correlations for the medium scales (for g = 0.25, g = 
— 1, and g = —4) and for the small scales (for g = 0.25 
and g = —1). Such values of g correspond to the small 
fluctuations. In contrast, for g = 2 the related coefficient 
does not show anything signihcant and behaves roughly 
similar to the no-correlation case of Fig. IlKc). 

The two further examples illustrating usefulness of the 
gDCCA coefficient are obtained from the binary MSM 
model producing a pair of the A:=20-level binary mul¬ 
tiplicative cascades with the g-level multipliers Mj{i) 
drawn from the set [1.2,0.8] and the set [1.25,0.75], 
respectively. The resulting time series are of length 
T = 1,048,576 and their values belong to the inter¬ 
val (0.01,100). In order to prepare the top panels of 
Fig. Illa)(b), we modify both time series in such a way 
that we leave the original values if they do not exceed 
a threshold of a; = 1.25 and randomize the values above 
this threshold. Then we create copies of the so-modihed 
time series and once again randomize their values if they 
are below x = 0.15. By doing this, we get the two pairs 
of signals: (i) a pair of signals without the largest fluctu¬ 
ations, (ii) a pair of signals without the largest and the 
smallest fluctuations. Now we compare the coefficients 
Pq{s) calculated for these pairs. First, we look at p 2 (s) 
(black solid lines). In both top panels, we see that P2{s) 
deviates significantly from unity for the scales s < 10^, 
which suggests that the cross-correlations are not uni¬ 
form across the fluctuation amplitudes. However, based 
solely on g = 2 we cannot state decisively, which fluc¬ 
tuations correspond to this decreased correlations. As 
regards a comparison between the panels (a) and (b), 
there is some visible difference in p2{s) for the middle 
scales (10^ < s < 10"^), but again one cannot decide 
what is the origin of this difference. 

Let us now include q ^ 2. For g = 4 the values of 
Pq{s) in both panels are even smaller than their coun¬ 
terparts for g = 2 and this is especially strong for the 
smallest scales s < 10^. The opposite relation is seen 
for g = 0.25, where po. 25 (s) is much larger than P 2 {s)- 
This means that by increasing g we obtain a decrease of 
Pqis), which can correctly be interpreted as a manifesta¬ 
tion of uncorrelated behavior of the largest fluctuations 
in both pairs of the signals. If we then compare the co¬ 
efficients for g = 0.25 and g = —1, we realize that they 
are systematically larger in (a) than in (b). Even for 
g = —2 the instability of P- 2 (s) suggest a lack of cross¬ 
correlations. All this means that the small fluctuations 
are weaker cross-correlated in (b) than in (a), exactly as 
it is expected from the signal construction in both cases. 

Next we once more use the time series generated by 
the binary MSM model. We randomize the fluctuations 
in the middle range of the amplitudes: 0.25 < x < 0.75, 
which comprises the most frequent fluctuations in both 
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FIG. 8: (Color online) The gDCCA coefficient Pq{s) cal¬ 
culated for a pair of time series from the binary Markov- 
switching multifractal (MSM) model with fc = 20 and Mj {i) = 
[1.25,0.75] (cascade no. 1) or Mj{i) = [1.2, 0.8] (cascade no. 
2). Different values of q are denoted by different lines. The 
case of q = 2 corresponds to the standard DCCA coefficient 
Pdcca. (Top) Pq{s) for (a) a pair of time series with random¬ 
ized fluctuations of large amplitudes x > 1.25 and for (b) a 
pair of time series with randomized fluctuations of large am¬ 
plitudes X > 1.25 and small amplitudes x < 0.15. (Bottom) 
Pg(s) for (c) a pair of time series with randomized fluctua¬ 
tions of medium amplitudes 0.25 < x < 0.75 and for (d) a 
pair of time series with randomized fluctuations of small and 
medium amplitudes x < 0.75. For clarity of the pictures, the 
inverted values of pq{s) are not distinguished. 


time series, so that only the small and the large fluctua¬ 
tions remain cross-correlated. The results for this pair of 
signals are shown in Fig. [8l[c). A copy of each time series 
is created and now the small fluctuations are a subject 
of additional randomization. Thus, we obtain the second 
pair of time series with only the large fluctuations that 
are still cross-correlated (Fig. [51[d)). The difference be¬ 
tween these pairs is the existence (c) and the lack (d) of 
the cross-correlations among the fluctuations with small 
amplitude. On the one hand, for q = 2 and q = 4 we do 
not observe any difference in Pq{s) between (c) and (d) as 
they are close to unity in both panels. On the other hand, 
the values of Pq{s) for q = —1 and q = —4 are substan¬ 
tially smaller for small s in (d) than in (c). Taking these 
results altogether, one can correctly conclude that both 
signal pairs are cross-correlated in the large fluctuations 
and uncorrelated in the small ones. It has to be stressed 
that drawing this conclusion would be impossible if we 
restricted our analysis to the q = 2 case of pocCA- (The 
uncorrelated character of the fluctuations of the medium 
amplitudes can be revealed in other comparative analy¬ 
ses, but here in this example such a range of amplitudes 
is beyond our interest.) 


IV. EMPIRICAL DATA 

Financial data is known to be cross-correlated across 
different temporal scales [l^, S, , so it can serve as 
a suitable subject to show practical application of the 
qDCCA coefficient. We choose data from two large finan¬ 
cial markets: the American stock market represented by 
the stocks traded at New York Stock Exchange (NYSE) 
or NASDAQ and the foreign currency exchange mar¬ 
ket (Eorex), which is a global market. In the former 
case, our data set comprises high-frequency recordings of 
the stock prices corresponding to the 100 largest Amer¬ 
ican companies over the years 1998-99. For each con¬ 
sidered stock i, the corresponding time series represents 
the logarithmic price increments (returns) ri{t^ At) = 
log Pi {t + At) — log Pi {t) sampled at a fixed time interval 
At = 5 min. The length of such time series is T = 40, 638 
data points. The sample results for two pairs of stocks: 
(a) Microsoft—Intel and (b) Microsoft—3M are displayed 
in Fig. [5] and tested against the null hypothesis that the 
cross-correlations are random (standard deviation of 100 
independent realizations of the shuffled surrogates). 

For q < —2 corresponding to small fluctuations, 
there is no statistically significant indication of cross¬ 
correlations between any pair of stocks. Genuine cross¬ 
correlations are however visible starting from q = —1 
up to high positive qs. One can see here that even if 
a vast majority of the Pq{s) values are originally larger 
than 1 (see the red dots representing the inverted values 
of the coefficient for q = — 1), the cross-correlations may 
be considered genuine as long as the function Pq{s) is 
approximately stable (i.e., without strong fluctuations) 
over some range of the scales. 

It is interesting to notice the weakening of the cross¬ 
correlations if one goes from small positive qs towards 
larger ones. It may be interpreted as a manifestation of 
the fact that the most cross-correlated fluctuations are 
those of the medium amplitudes, while the largest fluc¬ 
tuations are less correlated among the stocks. One can 
also infer that in both examples, the coefficients Pq{s) 
are larger for the small scales and smaller for the large 
scales. However, it can be the opposite for different choice 
of the stocks (not shown) so there is no regularity here. 
By looking at both parts of Fig. [9l one can see that 
there is an important difference between them: the cor¬ 
relations are much stronger (larger values of Pq{s)) for 
the stocks representing mutually related industrial sec¬ 
tors (Microsoft—Intel, Fig. [51[a)), while they are weaker 
for the stocks from unrelated sectors (Microsoft—3M 

Fig.EIb)). 

That this is a systematic effect one can learn from 
Fig. [ini where three groups of the stock pairs are consid¬ 
ered, representing different levels of the industrial sim¬ 
ilarity between companies: (i) the stocks corresponding 
to different industrial sectors, (ii) the stocks represent¬ 
ing the same sector but different subsectors, and (iii) the 
stocks corresponding to the same subsector (according 
to [1^). Obviously, the similarity level increases here 


























FIG. 9: (Color online) The gDCCA coefficient Pq{s) calcu¬ 
lated for two pairs of time series representing stock price log¬ 
arithmic returns sampled with At — 5 min frequency. (Top) 
Two stocks from the same industrial sector: Microsoft vs. In¬ 
tel. (Bottom) Two stocks from different industrial sectors: 
Microsoft vs. 3M. Each panel for both pairs shows Pq{s) cal¬ 
culated for different value of q (heavy black line) together 
with the mean {pq{s)) and the standard deviation ap{q,s) 
obtained from 100 independent realizations of the shuffled 
surrogate data (thin blue/grey lines with error bars). The 
inverted values of pq{s) for g < 0 are denoted by circles. 



FIG. 10: (Color online) The average gDCCA coefficients 
(pg(s)) calculated for the stock pairs divided into three 
groups: the stocks representing different industrial sectors 
(solid black), the stocks representing the same sector but 
different subsectors (dashed red/grey), and the stocks repre¬ 
senting the same subsector (dash-dotted green/grey). Dotted 
lines denote the zero-correlation level. 


gradually from (i) to (iii). We calculate Pq{s) for each 
pair of stocks (4950 pairs total: 4218 pairs in (i), 548 
pairs in (ii), and 184 pairs in (iii)) and then average 
the obtained functions over the pairs within each group. 
Fig. [TOl confirms that the values of {pq{s)) systematically 
increase if the pairs consist of the stocks with increasing 
industrial similarity. 

It is worth to mention that the correlations that lead 
to the non-zero values of Pq{s) are nonlinear. We infer 
this from a failed test of the null hypothesis that the 
surrogate signals with the same Pearson autocorrelation 
function can reproduce the Pq{s) functions. Fig. [TT] doc¬ 
uments the results of this test, showing that, on average, 
the Fourier-phase shuffling leads to complete destruction 
of the correlations that pq is sensitive to (compare with 
Fig. [SKa)). This is especially evident for the positive qs, 
for which the standard deviation ap{q,s) obtained from 
100 independent realizations of the surrogates is rela¬ 
tively small, especially for the small and medium scales s. 
For the large scales the MFDFA and MFCCA procedures 
are less efficient in detrending (the related polynomial de¬ 
gree TO = 2 is too low in this case) and this produces the 
non-zero deviations of {pq{s)) observed in Fig. ITT] 
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FIG. 11: (Color online) The average gDCCA coefficient 
{pq{s)) and its standard deviation ap{q,s) calculated for the 
Microsoft—Intel pair (the same as in Fig. EKtop)) after ran¬ 
domizing the Fourier phases of the original signals, which de¬ 
stroyed all the statistical dependences except for the Pearson 
autocorrelation function. 100 independent realizations of such 
surrogate data are used (thin blue/grey lines with error bars). 


The second empirical example comes from the Forex. 
We consider high-frequency (At=l min) logarithmic re¬ 
turns of the exchange rates among a set of major curren¬ 
cies recorded over the years 2004-2008. Fig. [T^ presents 
Pq{s) calculated for two such time series corresponding to 
the USD/EUR and GBP/USD rates. Since euro and the 
British pound are positively cross-correlated (both are 
currencies used in the European Union countries) and 
since the US dollar appears in the numerator of the first 
rate and in the denominator of the second rate, these 
time series reveal negative cross-correlations of their de¬ 
trended fluctuations. They are the strongest for g = — 1 
and g = 1, while their strength decreases if we move 
away from these values in both directions. One can 
also see in the topmost panels of Fig. [1^ that even for 
g < — 2 there is statistically significant indication of gen¬ 
uine cross-correlations for large s (if compared with the 
standard deviation of the shuffled surrogates). We infer 
from these results that the most strongly cross-correlated 
fluctuations of the Forex data are those of the medium 
amplitudes. The large-amplitude fluctuations are rela¬ 
tively weaker but still substantially cross-correlated and 
even the fluctuations with the small amplitudes reveal 


FIG. 12: (Golor online) The gDCCA coefficient Pq{s) cal¬ 
culated for a pair of time series representing returns of the 
USD/EUR and GBP/USD exchange rates sampled with At = 
1 min frequency. Each panel shows Pq{s) calculated for differ¬ 
ent value of (heavy black line) together with the mean {pq{s)) 
and the standard deviation ap{q,s) obtained from 100 in¬ 
dependent realizations of the shuffled surrogate data (thin 
blue/grey lines with error bars). The inverted values of pq{s) 
for g < 0 are denoted by circles. 

weak cross-correlations (unlike their uncorrelated coun¬ 
terparts from the stock market in Fig. |9]). 

V. SUMMARY 

We proposed a new measure of the multiscale de¬ 
trended cross-correlations between a pair of time series, 
called the gDCCA coefficient. This coefficient forms ac¬ 
tually a family of functions Pq{s) that depend on the 
exponent g and the temporal scale s. Like in the case of 
the multifractal, g-dependent fluctuation functions, by 
varying a value of g, we can amplify those fluctuations 
that are of specihc amplitudes: e.g., the small ones for 
g <?C 0 or the large ones for g 0, and study the struc¬ 
ture of the cross-correlations among such fluctuations. A 
specific case of pq for g = 2 corresponds to the DCCA 
coefficient pdcca already known from literature [i^, so 
from this perspective the pq coefficient may be viewed 
as its generalization. For g > 0, the interpretation of pq 
is straightforward as its value is confined in the range 
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~1 S Pij ^ the maximum value of pq = 1 means that 
the corresponding detrended fluctuations of both signals 
under study show exactly the same correlation structure, 
Pq = —1 means that such fluctuations are completely 
anticorrelated, and pq = 0 indicates that they are inde¬ 
pendent of each other. For g < 0, on the other hand, the 
coefficient \pq\ may assume arbitrarily large values, which 
can lead to interpretation problems, but this may happen 
especially if the detrended fluctuations of the two signals 
are uncorrelated. Knowing that, we may be concerned 
about the exact values of Pq in this case. For practical 
reasons, we overcome this problem by replacing the orig¬ 
inal values \pq\ > 1 by their inversion and thus forcing 
them to ht into the normal interval (—1,1). In the case 
of no correlations, a course of the function Pq{s) becomes 
strongly unstable and its values wildly fluctuate. This 
effect can thus serve as an optical indication of the sig¬ 
nal independence. In contrast, if the signal fluctuations 
are cross-correlated, we obtain largely stable behavior of 
Pq{s) even for g < 0 and even if the original values of 
|pg(s)| slightly exceed 1. Therefore, in this case a lack of 
wild fluctuations of Pq{s) and its non-zero value can sig¬ 
nalize the existence of cross-correlations. All this means 
that the family of the coefficients pq is a well-defined 
tool of the multiscale detrended cross-correlation analy¬ 
sis. Obviously, because of the more demanding interpre¬ 
tation of results for g < 0, one may restrict an analysis 


to the g > 0 case, but this is equivalent to ignoring the 
cross-correlation structure of small fluctuations, so we do 
not recommend it. 

In order to illustrate the performance of this new mea¬ 
sure, we applied it to a few selected data sets represent¬ 
ing long-memory processes: ARFIMA and the Markov- 
switching multifractal. We showed that pq is able to 
identify correctly specihc cross-correlations that remain 
undetected by the standard DCCA cross-correlation co¬ 
efficient pdcca as well as it is able to distinguish be¬ 
tween the processes whose detrended fluctuations are 
cross-correlated in different amplitude ranges — a task 
that is infeasible with pdcca- We also carried out 
the Pq analysis based on sample empirical data from 
the financial markets and found that the detrended 
cross-correlations among such data also depend on the 
detrended-fluctuation amplitude. Straightforward and 
likely fruitful for a more global correlation analysis would 
be to form the g-dependent counterparts of the conven¬ 
tional correlation matrices for larger sets of multivariate 
data and to study their spectral properties as it is com¬ 
monplace in the standard correlation matrix analysis. By 
proceeding further in this direction, one may then also 
consider constructing the g-dependent graphs. On the 
other hand, a parallel approach would be defining the g- 
generalized version of the DMCA coefficient pdmca 0 
in a full analogy to our approach. 
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